SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Stocco, Douglas M. 

Clark, Barbara J. 

(ii) TITLE OF INVENTION ; 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Akin, Gump, Strauss, Hauer & Feld, L.L. 

(B) STREET : 1900 Frost Bank Plaza, 816 Congress Avenue 

(C) CITY: Austin 

(D) STATE: TX 

(E) COUNTRY: U.S.A. 

(F) ZIP: 78701 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

(B) FILING DATE; Concurrently Herewith 

(C) CLASSIFICATION: Unknown 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Mayfield, Denise L. 

(B) REGISTRATION NUMBER: 33,732 

(C) REFERENCE /DOCKET NUMBER: 0433 75 . 0006/DLM 



COMPOSITIONS AND METHODS FOR REGULATION OF 
STEROIDOGENESIS 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 512/499-6200 

(B) TELEFAX: 512/499-6290 

5 (2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



20 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc =■ "DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GTCGACCCAC GCGTCCGCTC AGGACCTTGA AAGGCTCAGG AAGAACAACC CTTGAGCACC 60 

TCAGCACTCA GCATGTTCCT CGCTACGTTC AAGCTGTGTG CTGGAAGCTC CTATAGACAT 120 

ATGCGGAATA TGAAAGGATT AAGGCACCAA GCTGTGCTGG CCATTGGCCA AGAGCTCAAC 180 

TGGAGAGCAC TGGGGGATTC CAGTCCCGGG TGGATGGGTC AAGTTCGACG TCGGAGCTCT 24 0 

25 CTGCTTGGTT CTCAACTGGA AGCAACACTC TATAGTGACC AGGAGCTGTC CTACATCCAG 3 00 

CAGGGAGAGG TGGCTATGCA GAAGGCCTTG GGCATACTCA ACAACCAGGA AGGCTGGAAG 3 60 

AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GTGCTAAGTA AGATGGTGCC AGATGTGGGC 420 

30 

AAGGTGTTTC GCTTGGAGGT GGTGGTAGAC CAGCCCATGG ACAGACTCTA TGAAGAACTT 480 

GTGGACCGCA TGGAGGCCAT GGGAGAGTGG AACCCAAATG TCAAGGAGAT CAAGGTCCTG 54 0 

35 CAGAGGATTG GAAAAGACAC GGTCATCACT CATGAGCTGG CTGCGGCGGC AGCAGGCAAC 600 
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10 



30 



35 



(2) INFORMATION FOR SKQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acida 



660 



720 



340 



900 



CTGGTGGGGC CTCGAGACTT CGTGAGCGTG CGCTGTACCA AGCGCAGAGG TTCCACCTGT 
GTGCTGGCAG GCATGGCCAC ACATTTTGGG GAGATGCCGG AGCAGAGTGG TGTCATCAGA 

GCTGAACACG GCCCCACCTG CATGGTGCTT CATCCACTGG CTGGAAGTCC CTCCAAGACT 78 0 
AAACTCACTT GGCTGCTCAG TATTGACCTG AAGGGGTGGC TGCCGAAGAC AATCATCAAC 
CAGGTCCTAT CGCAGACCCA GATAGAGTTC GCCAACCACC TGCGCAAGCG CCTGGAAGCC 

AGCCCTGCCT CTGAGGCCCA GTGTTAAGGA CTGTCCACCA CATTGACCTG CAAATCATTG 960 

GAAGCTCTCA CAGGAAGCCT GCAAGTCTGT CCATCTTCAG CTAACAGCAT CGGGAGGGGT 1020 

15 GGTAGTCAGG AGACACTAGG ACTGACTGGT AAAATCAGGA TCAGCAAAAT AGAAATGAGG 1080 

CTTAGAATAA AAGTTCTCTA GTGTCTCCCA CTGCATAGCT GTGAAGGCTA AGGGATAAGT 1140 

AGCTATGAAA CCTTTCATCT AGGCTTGTAT ATGCTGACCT AAAAGACACC AGCAGCTACG 1200 

20 

AACAGGGGAT GCTAAGGATC GGGAACTGTT GTCTTACCAG CTCCAAATGT CACTACCTGA 126 0 

AGGCAGTGTG CACACAAAGC AAGGTCTTGC CTAGGAAACT CTGTAAAAGT TCTCCTCTGT 1320 

25 AAAAGGCCAG AACTTGAATG AAACTACCTA CAAAGGGCCT TTCCAGAGTA TTCCAACTTT 13 80 

TCTCTGAGGA GAAATGAAAC CATCATTGTG CCGACTTCCC TACTAATCCC ATGACAATAA 1440 

AGAACATACA TAAAAAAAAA AAAAAA 1466 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ila Thr Pne Lys Leu Cys Ala Gly Ser Ser Tvr Ara 



Met Phe Leu Ala Thr Pne Lya Leu Cys Ala Gly Ser Ser Tyr Arg His 

10 1 5 10 15 

,T, K H $\ L k I £ H % h V I /! I .Co 

Met Arg Asn Met Lys Gly Leu Arg His Gin Ala Val Leu Ala lie Gly 

20 25 30 

<\ t L m w f( /] , A _b c - p G V) |H 

15 Gin Glu Leu Asn Trp Arg Ala Leu Gly Asp Ser Ser Pro Gly Trp Met 

35 40 45 

Gly Gin Val Arg Arg Arg Ser Ser Leu Leu Gly Ser Gin Leu Glu Ala 

50 55 60 

20 r l v 5 .D e< fc ^ s y 1 Q Q & £ \l 

Thr Leu Tyr Ser Asp Gin Glu Leu Ser Tyr lie Gin Gin Gly Glu Val 

65 70 75 80 

/I ft Q K ft l. £, l" L a/ a/ & £ £ u) k. 

Ala Met Gin Lys Ala Leu Gly lie Leu Asn Asn Gin Glu Gly Trp Lys 

25 85 90 95 

Y £ $ Q <v /- N £ $ I V L < K A' \J 

Lys Glu Ser Gin Gin Glu Asn Gly Asp Glu Val Leu Ser Lys Met Val 

100 105 110 

P "D v C K v f fl l £ v v V 7) # /' 

30 Pro Asp Val Gly Lys Val Phe Arg Leu Glu Val Val Val Asp Gin Pro 

115 120 125 

/>/ r> K L V t £ L V h ^ ft] i: /? /// G 

Met Asp Arg Leu Tyr Glu Glu Leu Val Asp Arg Met Glu Ala Met Gly 

130 135 140 

35 
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t Iaj N f> N V i; L' i /' v / Q f I £ 

Glu Trp Asn Pro Asn Val Lys Glu He Lys Val Leu Glh Arg He Gly 

145 150 155 160 

K 2> T V 2 T H £' f ] $ / } f< /} ^ H 

Lys Asp Thr Val He Thr His Glu Leu Ala Ala Ala Ala Ala Gly Asn 

5 165 170 175 

L V Q f K Tj i V > V *L r T~ t J £ 

Leu Val Gly Pro Arg Asp Phe Val Ser Val Arg Cys Thr Lys Arg Arg 

180 185 190 

£ " f C v L (] 6 fi] /IT | If C £ /)} 

10 Gly Ser Thr Cys Val Leu Ala Gly Met Ala Thr His Phe Gly Glu Met 

195 200 205 

I' £ § < 6 \l i H A ^ W & P T c /H 

Pto Glu Gin Ser Gly Val He Arg Ala Glu His Gly Pro Thr Cys Met 

210 215 220 

V ; L il P L P 6 c > P 5 £ T t L T JO 

Val Leu His Pro Leu Ala Gly Ser Pro Ser L^s Thr Lys Leu Thr TCp 

225 230 235 240 



15 



I l s x p l « ^ u) l p k' r tf J M 

Leu Leu Se~r lie Asp Leu Lys Giy Trp Leu Pto hy& Thr He He Asn 

20 245 250 255 

ft ^ L s & I r ^ f ^ >J ft L k' 

Gin Val Leu S« Gin Thr Gin He Glu Phe Ala Asn His Lfcii At$ Lys 

260 265 270 

II L £ fl J p 4 5 ^ ft Q (L 

25 Ar^r Leu Glu Ala Ser Pro Ala Ser Glu Ala Glfa Cys 

275 280 



(2) INFORMATION FOR SEQ ID NO: 3: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 
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(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ft £ \\ Cr 9 t c ai v u H p l / 

5 Ala Glu His Gly Pro Thr Cys Met Val Leu His Pro Leu Al 

15 10 

(2) INFORMATION FOR SEQ ID NO:4: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



25 



(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

!\ l £ l l h\ a) <s e a io £ 

Ala Leu Gly lie Leu Asn Asn Gin Glu Gly Trp Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 5; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 




(2) INFORMATION FOR SEQ ID NO:6: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



N q € £ lx) \c 

20 Asn Gin Glu Gly Trp Lys 



(2) INFORMATION FOR SEQ ID NO: 7: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

35 



0£ 



ua Glu His Gly pro Thr Cys Met Val 



5 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



X U N M ^ F & u) K ^ £" 

He Leu Asn Asn Gin Glu Gly Trp Lys Lys Glu 



15 10 
20 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

30 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of<3, 12, 15, 18) 

(D) OTHER INFORMATION: /modjDase- OTHER 
35 /note- "N » (A or C or G or T/U) or (unknown or other) " 



07 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base- OTHER 
5 /note- "R = A or G" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(9, 21) 

10 (D) OTHER INFORMATION: /mod_base= OTHER 

/note- " Y = C or T/U" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

15 

GCNGARCAYG GNCCNACNTG YATGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA" 

(ix) FEATURE: 
30 (A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(5, 17) 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note- "R » A or G n 



35 (ix) FEATURE : 
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(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(8, 11, 14, 23) 

(D) OTHER INFORMATION: /mod_base* OTHER 
/note* "N = (A or C or G or T/U) or (unknown or other) " 

5 

{ix) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /mod_base- OTHER 
10 /note- "Y = C or T/U" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCATRCANGT NGGNCCRTGY TCNGC 2 5 

15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 <ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA" 

(ix) FEATURE : 

(A) NAME/KEY: modif ied_base 
30 (B) LOCATION: 3 

(D) OTHER INFORMATION: /modjaase- OTHER 
/note* "Y * C or T/U" 

(ix) FEATURE: 
35 (A) NAME/KEY: modif ied_base 



/note= 



(B) LOCATION: one-of(6, 9) 
(D) OTHER INFORMATION: /mod_base= OTHER 
"R ■ A or G" 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION : 12 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note* "N » (A or C or G or T/U) or (unknown or 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAYCARCARG GNTGGAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - " DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

TTCCANCCYT CYTGRTT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 

-90- 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION t /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

10 AACCAGGAAG GCTGGAAGAA GGAAAGCCAG CAGGAGAACG GGGACGAAGT GCTAAGTAAG 60 

ATGGTGCCAG ATGTGGGCAA GGTGTTTCGC TTGGAGGTGG TGGTAGACCA GCCCATGGAC 120 



15 



AGACTCTATG AAGAACTTGT GGACCGCATG GAGGCCATGG GAGAGTGGAA CCCAAATGTC 180 



AAGGAGATCA AGGTCCTGCA GAGGATTGGA AAAGACACGG TCATCACTCA TGAGCTGGCT 240 



GCGGCGGCAG CAGGCAACCT GGTGGGGCCT CGAGACTTCG TGAGCGTGCG CTGTACCAAG 3 00 

20 CGCAGAGGTT CCACCTGTGT GCTGGCAGGC ATGGCCACAC ATTTTGGGGA GATGCCGGAG 360 



CAGAGTGGTG TCATCAGAGC TGAACACGGC CCCACCTGCA T 4 01 



25 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "RNA" 

35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GUCGACCCAC GCGUCCGCUC AGGACCUUGA AAGGCUCAGG AAGAACAACC CUUGAGCACC 60 

5 UCAGCACUCA GCAUGUUCCU CGCUACGUUC AAGCUGUGUG CUGGAAGCUC CUAUAGACAU 12 0 

AUGCGGAAUA UGAAAGGAUU AAGGCACCAA GCUGUGCUGG CCAUUGGCCA AGAGCUCAAC 18 0 

UGGAGAGCAC UGGGGGAUUC CAGUCCCGGG UGGAUGGGUC AAGUUCGACG UCGGAGCUCU 24 0 

10 

CUGCUUGGUU CUCAACUGGA AGCAACACUC UAUAGUGACC AGGAGCUGUC CUACAUCCAG 300 

CAGGGAGAGG UGGCUAUGCA GAAGGCCUUG GGCAUACUCA ACAACCAGGA AGGCUGGAAG 3 60 

15 AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GUGCUAAGUA AGAUGGUGCC AGAUGUGGGC 42 0 

AAGGUGUUUC GCUUGGAGGU GGUGGUAGAC CAGCCCAUGG ACAGACUCUA UGAAGAACUU 480 

GUGGACCGCA UGGAGGCCAU GGGAGAGUGG AACCCAAAUG UCAAGGAGAU CAAGGUCCUG 540 

20 

CAGAGGAUUG GAAAAGACAC GGUCAUCACU CAUGAGCUGG CUGCGGCGGC AGCAGGCAAC 600 

CUGGUGGGGC CUCGAGACUU CGUGAGCGUG CGCUGUACCA AGCGCAGAGG UUCCACCUGU 660 

25 GUGCUGGCAG GCAUGGCCAC ACAUUUUGGG GAGAUGCCGG AGCAGAGUGG UGUCAUCAGA 72 0 

GCUGAACACG GCCCCACCUG CAUGGUGCUU CAUCCACUGG CUGGAAGUCC CUCCAAGACU 78 0 

AAACUCACUU GGCUGCUCAG UAUUGACCUG AAGGGGUGGC UGCCGAAGAC AAUCAUCAAC 840 

30 

CAGGUCCUAU CGCAGACCCA GAUAGAGUUC GCCAACCACC UGCGCAAGCG CCUGGAAGCC 900 

AGCCCUGCCU CUGAGGCCCA GUGUUAAGGA CUGUCCACCA CAUUGACCUG CAAAUCAUUG 960 

35 GAAGCUCUCA CAGGAAGCCU GCAAGUCUGU CCAUCUUCAG CUAACAGCAU CGGGAGGGGU 1020 



GGUAGUCAGG AGACACUAGG ACUGACUGGU AAAAUCAGGA UCAGCAAAAU AGAAAUGAGG 



1080 



CUUAGAAUAA AAGUUCUCUA GUGUCUCCCA CUGCAUAGCU GUGAAGGCUA AGGGAUAAGU 1140 

AGCUAUGAAA CCUUUCAUCU AGGCUUGUAU AUGCUGACCU AAAAGACACC AGCAGCUACG 12 00 

AACAGGGGAU GCUAAGGAUC GGGAACUGUU GUCUUACCAG CUCCAAAUGU CACUACCUGA 1260 

AGGCAGUGUG CACACAAAGC AAGGUCUUGC CUAGGAAACU CUGUAAAAGU UCUCCUCUGU 1320 

AAAAGGCCAG AACUUGAAUG AAACUACCUA CAAAGGGCCU UUCCAGAGUA UUCCAACUUU 1380 

UCUCUGAGGA GAAAUGAAAC CAUCAUUGUG CCGACUUCCC UACUAAUCCC AUGACAAUAA 1440 

AGAACAUACA UAAAAAAAAA AAAAAA 1466 



(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE : 

(C) STRAND ED NESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO:15: 

267 GTGGATTAAC CAGGTTCGGC GGCGGAGCTC TCTACTCGGT TCTCGGCTGG AAGAGACTCT 326 
327 CTACAGTGAC CAGGAGCTGG CCTATCTCCA GCAGGGGGAG GAGGCCATGC AGAAGGCCTT 3 86 
387 GGGCATCCTT AGCAAC CAAG AGGGCTGGAA GAAGGAGAGT CAGCAGGACA ATGGGGACAA 446 
447 AGTGATGAGT AAAGTGGTCC CAGATGTGGG CAAGGTGTTC CGGCTGGAGG TCGTGGTGGA 506 
507 CCAGCCCATG GAGAGGCTCT ATGAAGAGCT CGTGGAGCGC ATGGAAGCAA TGGGGGAGTG 566 



567 GAACCCCAAT GTCAAGGAGA TCAAGGTCCT 

627 TCACGAGCTG GCTGCCGAGG CAGCAGGAAA 

687 GCGCTGTGCC AAGCGCCGAG GCTCCACCTG 

747 GAACATGCCT GAGCAGAAGG GTGTCATCAG 

8 07 TCACCCGTTG GCTGGAAGTC CCTCTAAGAC 

867 CAAGGGGTGG CTGCCCAAGA GCATCATCAA 

92 7 TGCCAACCAC CTGCGCAAGC GCCTGGAGTC 

987 AC 988 



GCAGAAGATC GGAAAAGATA CATTCATTAC 626 
CCTGGTGGGG CCCCGTGACT TTGTGAGCGT 686 
TGTGCTGGCT GGCATGGACA CAGACTTCGG 746 
GGCGGAGCAC GGTCCCACTT GCATGGTGCT 8 06 
CAAACTTACG TGGCTACTCA GCATCGACCT 86 6 
CCAGGTCCTG TCCCAGACCC AGGTGGATTT 926 
CCACCCTGCC TCTGAAGCCA GGTGTTGAAG 986 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRAND EDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO:16: 

127 ATGCTGCTAG CGACATTCAA GCTGTGCGCT GGGAGCTCCT ACAGACACAT GCGCAACATG 186 
187 AAGGGGCTGA GGCAACAGGC TGTGATGGCC ATCAGCCAGG AGCTGAACCG GAGGGCCCTG 246 
247 GGGGGCCCCA CCCC 260 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRAND EDNESS : 

(D) TOPOLOGY: 
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5 



10 



30 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

10 51 ACTGGAAGCC TGCAAGTCT 106 9 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 
15 (D) TOPOLOGY: 

(ii) MOLECULE TYPE : 

(A) DESCRIPTION: 

20 SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

1 MLLATFKLCA GSSYRHMRNM KGLRQQAVMA ISQELNRRAL GGPTPSTWIN 50 

51 QVRRRSSLLG SRLEETLYSD QELAYLQQGE EAMQKALGIL SNQEGWKKES 100 

101 QQDNGDKVMS KWPDVGKVF RLEWVDQPM ERLYEELVBR MEAMGEWNPN 150 

25 151 VKEIKVLQKI GKDTFITHEL AAEAAGNLVG PRDFVSVRCA KRRGSTCVLA 200 

201 GMATDFGNMP EQKGVIRAEH GPTCMVLHPL AGSPSKTKLT WLLSIDLKGW 250 

251 LPKSIINQVL SQTQVDFANHL RKRLESHPA SEARC 285 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 
35 (D) TOPOLOGY: 
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10 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

agaacaccag gtccaggctg cagctgcggg actcagaggc gaacgttgag gggctcagga 60 
aggacgaaga accacccttg agagaagagg cagcagcagc gcggcagcag cagcggcagc 120 
gaccccacca ctgccacatt tgccaggaaa caATGCTGCT AGCGACATTC AAGCTGTGCG 180 
CTGGGAGCTC CTACAGACAC ATGCGCAACA TGAAGGGGCT GAGGCAACAG GCTGTGATGG 240 
CCATCAGCCA GGAGCTGAAC CGGAGGGCCC TGGGGGGCCC CACCCCTAGC ACGTGGATTA 300 
ACCAGGTTCG GCGGCGGAGC TCTCTACTCG GTTCTCGGCT GGAAGAGACT CTCTACAGTG 360 
ACCAGGAGCT GGC CTATCTC CAGCAGGGGG AGGAGGCCAT GCAGAAGGCC TTGGGCATCC 420 
TTAGCAACCA AGAGGGCTGG AAGAAGGAGA GTCAGCAGGA CAATGGGGAC AAAGTGATGA 480 
GTAAAGTGGT CCCAGATGTG GGCAAGGTGT TCCGGCTGGA GGTCGTGGTG GACCAGCCCA 540 
15 TGGAGAGGCT CTATGAAGAG CTCGTGGAGC GCATGGAAGC AATGGGGGAG TGGAACCCCA 600 

ATGTCAAGGA GATCAAGGTC CTGCAGAAGA TCGGAAAAGA TACATTCATT ACTCACGAGC 660 
TGGCTGCCGA GGCAGCAGGA AACCTGGTGG GGCCCCGTGA CTTTGTGAGC GTGCGCTGTG 720 
CCAAGCGCCG AGGCTCCACC TGTGTGCTGG CTGGCATGGC CACAGACTTC GGGAACATGC 780 
CTGAGCAGAA GGGTGTCATC AGGGCGGAGC ACGGTCCCAC TTGCATGGTG CTTCACCCGT 840 
TGGCTGGAAG TCCCTCTAAG ACCAAACTTA CGTGGCTACT CAGCATCGAC CTCAAGGGGT 900 
GGCTGCCCAA GAGCATCATC AACCAGGTCC TGTCCCAGAC CCAGGTGGAT TTTGCCAACC 960 
ACCTGCGCAA GCGCCTGGAG TCCCACCCTG CCTCTGAAGC CAGGTGTTGA agaccagcct 1020 
gctgttccca actgtgccca gctgcactgg tacacacgct catcaggaga atccctactg 1080 
gaagcctgca agtctaagat ctccatctgg tgacagtggg atgggtgggg ttcgtgttta 1140 
gagtatgaca ctaggattca gattggtgaa agtttttagt accaagaaaa cagggatgag 1200 
ctcttggatt aaaaggtaac ttcattcact gattagctat gacatgaggg ttcaggcccg 1260 
ctaaaaataa ttgtaaaact ttttttctgg gcccttatgt acccacctaa aaccatcttt 1320 
aaaatgctag tggctgatat gggtgtgggg gatgctaacc acagggcctg agaagtcttg 13 8 0 
ctttatgggc tcaagaatgc catgcgctgg cagtacatgt gcacaaagca gaatctcaga 1440 
gggtctcctg cagccctctg ctcctcccgg ccgctgcaca gcaacaccac agaacaagca 1500 
gcaccccaca gtgggtgcct tccagaaata tagtccaagc tttctctgtg gaaaaagaca 1560 
aaactcatta gtagacatgt ttccctattg ctttcatagg caccagtcag aataaagaat 1620 
cataattcac acaaaaaaaa a 1641 



20 



25 



30 
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